WO 99/16884 



1 / 17 



PCT/EP98/06040 



Figure 1 : A/ Map of plasmidpRIT 14586 




B/ Coding sequence of the first 127 amino acids 
of protein D and multiple cloning site. The signal 
sequence is underlined. 

BamHI 

ATG GAT CCA AAA ACT TTA CCC CTT TCT TTA TTA GCA GCT GCC GTA CTA GCA GGT TGT AGC AGC 

Met Aip Pro Lyi Thr Leu Ala Leu Scr Leu Leu Ala AJa Gly VaJ Leu Ala Gly Cys Scr Ser 

CAT TCA TCA AAT ATG GCG AAT ACC CAA ATG AAA TCA GAC AAA ATC ATT ATT GCT CAC CGT GGT 

His Ser Ser Aw Met Alt Asn Thr Gbi Met Lys Ser Asp Lys lie He He Ala His Arg Gly 

GCT AGC GGT TAT TTA CCA GAG CAT ACG TTA GAA TCT AAA GCA CTT GCT TTT GCA CAA CAG GCT 

Ala Ser Gly Tyr Leu Pro Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala 

GAT TAT TTA GAG CAA GAT TTA GCA ATG ACT AAG GAT GGT CGT TTA GTG GTT ATT CAC GAT CAC 

Asp Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val He His Asp His 

TTT TTA GAT GCC TTG ACT GAT GTT GCG AAA AAA TTC CCA CAT CGT CAT CGT AAA GAT GGC CGT 

Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe Pro His Arg His Arg Lys Asp Gly Arg 

TAC TAT GTC ATC GAC TTT ACC TTA AAA GAA ATT GAA AGT TTA GAA ATG ACA GAA AAC TTT GAA 

Tyr Tyr Val lie Asp Phe Thr Leu Lys Glu lie Gin Ser Leu Glu Met Thr Glu Asn Phe Glu 

Ncol jp C | Xh*l 

ACC ATG GCC ACG TGT GAT CAG AGC TCA ACT AGT GCA CAC CAT CAC CAT CAC CAT TAA TCTaGa 

Thr Met Ala Thr Cys Asp Gin Ser Scr Thr Ser Gly His His His His His His • 



The amino acid sequence of Figure 1 relates to Seq. ID no. 7 and the nucleic acid sequence of 
Figure 1 relates to Seq. ID. No. 6. 
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The DNA and amino acid sequences of Nef-His; Tat-His; Nef-Tat-His fusion and 
mutated Tat is illustrated. 

Pichia-expressed constructs (plain constructs) 
=> Nef-HIS 

DNA sequence (Seq. ID. No. 8) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGA 
ATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAA 
AAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGG 
CTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTA 
AGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGG 
GGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATC 
TACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTC 
AGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAG 
GTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCAT 
GGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCA 
TTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGC 
CACCATCACCATCACCATTAA 

Protein sequencefSeq. ID. No. 9) 



MGGKWS KSS WGWPTVRERMRRAE PAADGVGAASRDLEKHGAI TS SNTAATNAACAW 
LEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWI 
YHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLH 
GMD.DPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSGHHHHHH. 



Tat- HIS 

DNA sequence (Seq. ID. No. 10) 



ATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAA 
ACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTC 
ATAACAAAAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGA 
CCTCCTCAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAA 
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TCCCGAGGGGACCCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCAT 
TAA 

Protein sequence (Seq. ID. No. 11) 

MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFITKALGISYGRKKRRQRRR 
PPQGSQTHQVSLSKQPTSQSRGDPTGPKETSGHHHHHH. 



=> Nef-Tat-HIS 

DNA sequence (Seq. ID. No. 12) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGA 
ATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAA 
AAACATGGAGCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGG 
CTAGAAGCACAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTA 
AGACCAATGACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGG 
GGACTGGAAGGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATC 
TACCACACACAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTC 
AGATATCCACTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAG 
GTAGAAGAGGCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCAT 
GGAATGGATGACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCA 
TTTCATCACGTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAG 
CCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCT 
TGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACA 
AAAGCCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCT 
CAAGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGA 
GGGGACCCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCATTAA 

Protein sequencefSeq. ID. No. 13) 



MGGKWS KS S WGWPTVRERMRRAE PAADGVGAAS RDLEKHGAI TS SNTAATNAACAW 
LEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWI 
YHTQGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLH 
GMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTA 
CTNCYCKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSR 
GDPTGPKETSGHHHHHH. 



E.coli-expressed constructs (fusion constructs) 
LipoD-Nef-HIS 
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DNA sequence (Seq. ID. No. 14) 

Nucleotides corresponding to the Prot D Fusion Partner are in bold. 
The Lipidation Signal Sequence is underlined. After processing, the cysteine 
coded by the TGT codon, indicated with a star, becomes the amino terminal 
residue which is then modified by covalently bound fatty acids. 

-% 

ATGGATCCAAAAACTTTAGCCCTTTCTTTATTAGCAGCTGGCGTACTAGCAGGT TGT 

AGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAAATCATTATT 

GCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCTAAAGCACTT 

GCTTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACTAAGGATGGT 

CGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTTGCGAAAAAA 

TTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTTACCTTAAAA 

GAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGCAAGTGGTCA 

AAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCA 

GCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACA 

AGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAG 

GAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 

GCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 

CACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 

TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTT 

GGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAA 

GGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAG 

AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGA 

GAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGCCACCATCACCATCACCAT 
TAA 

Protein sequence ofthe proc essed lipidated ProtD-Nef-HIS protein (Seq. ID. No. 15) 
(Amino-acids corresponding to Prot D fusion partner are in bold) 



CS SHSSNMANTQMKSDKI I IAHRGASGYLPEHTLESKALAFAQQADYLEQDLAMTKD 
GRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGGKW 

SKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQE 
EEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQG 
YFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDP 
ERE VLE WR FDS RLAFHHVARE LH P E Y F KNCT S GHHHHHH . 



=> LipoD-Nef-Tat-HIS 

DNA sequence (Seq. ID. No. 16) 
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Nucleotides corresponding to the Prot D Fusion Partner are in bold. 
The Lipidation Signal Sequence is underlined. After processing, the cysteine 
coded by the TGT codon, indicated with a star, becomes the amino terminal 
residue which is then modified by covalently bound fatty acids. 

ATGGATCCAAAAACTTTAGCCCTTTCTTTATTAGCAGCTGGCGTACTAGCAGGT TGT 
AGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAAATCATTATT 
GCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCTAAAGCACTT 
GCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACTAAGGATGGT 
CGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTTGCGAAAAAA 
TTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTTACCTTAAAA 
GAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGCAAGTGGTCA 
AAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCA 
GCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGAGCAATCACA 
AGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCACAAGAGGAG 
GAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACTTACAAG 
GCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATT 
CACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACACAAGGCTAC 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCACTGACCTTT 
GGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAA 
GGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGATGACCCTGAG 
AGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCACGTGGCCCGA 
GAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTAGACTA 
GAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTGT 
AAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAAAAGCCTTAGGCATCTCC 
TATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGTCAGACTCAT 
CAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCCG 
AAGGAAACTAGTGGCCACCATCACCATCACCATTAA 



Protein sequence of the pro cessed lipidated ProtD-NEF-TAT-HIS protein (Sea. ID. No. 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

CSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQQADYLEQDLAMTKD 
GRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGGKW 

SKSSWGWPTVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAATNAACAWLEAQE 

EEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQG 

YFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDP 

EREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTACTNCY 

CKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSRGDPTG 
PKETSGHHHHHH . 
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=> ProtD-Nef-HIS 

DNA sequence (Seq. ID. No. 18) 

Nucleotides corresponding to the Prot D Fusion Partner are in bold. 

ATGGATCCAAGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAA 

ATCATTATTGCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCT 

AAAGCACTTGCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACT 

AAGGATGGTCGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTT 

GCGAAAAAATTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTT 

ACCTTAAAAGAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGC 

AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGA 

GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGA 

GCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCA 

CAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATG 

ACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAA 

GGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACA 

CAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCA 

CTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAG 

GCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGAT 

GACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCAC 

GTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGGCCACCATCAC 
CATCACCATTAA 



Protein sequence (Seq. ID. No. 19) 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

MDPSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQQADYL 
EQDLAMTKDGRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLK 

EIQSLEMTENFETMGGKWSKSSVVGWPTVRERMRRAEPAADGVGAASRDL 

EKHGAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSH 

FLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPGPGVRYPLTFGW 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFH 

HVARELHPEYFKNCTSGHHHHHH 



rz> ProtD-Nef-Tat-HIS 

DNA sequence (Seq. ID. No. 20) 



WO 99/16884 PCT/EP98/06040 

7/17 

Nucleotides corresponding to the Prot D Fusion Partner are in bold. 

ATGGATCCAAGCAGCCATTCATCAAATATGGCGAATACCCAAATGAAATCAGACAAA 

ATCATTATTGCTCACCGTGGTGCTAGCGGTTATTTACCAGAGCATACGTTAGAATCT 

AAAGCACTTGCGTTTGCACAACAGGCTGATTATTTAGAGCAAGATTTAGCAATGACT 

AAGGATGGTCGTTTAGTGGTTATTCACGATCACTTTTTAGATGGCTTGACTGATGTT 

GCGAAAAAATTCCCACATCGTCATCGTAAAGATGGCCGTTACTATGTCATCGACTTT 

ACCTTAAAAGAAATTCAAAGTTTAGAAATGACAGAAAACTTTGAAACCATGGGTGGC 

AAGTGGTCAAAAAGTAGTGTGGTTGGATGGCCTACTGTAAGGGAAAGAATGAGACGA 

GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAACATGGA 

GCAATCACAAGTAGCAATACAGCAGCTACCAATGCTGCTTGTGCCTGGCTAGAAGCA 

CAAGAGGAGGAGGAGGTGGGTTTTCCAGTCACACCTGAGGTACCTTTAAGACCAATG 

ACTTACAAGGCAGCTGTAGATCTTAGCCACTTTTTAAAAGAAAAGGGGGGACTGGAA 

GGGCTAATTCACTCCCAACGAAGACAAGATATCCTTGATCTGTGGATCTACCACACA 

CAAGGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCAGATATCCA 

CTGACCTTTGGATGGTGCTACAAGCTAGTACCAGTTGAGCCAGATAAGGTAGAAGAG 

GCCAATAAAGGAGAGAACACCAGCTTGTTACACCCTGTGAGCCTGCATGGAATGGAT 

GACCCTGAGAGAGAAGTGTTAGAGTGGAGGTTTGACAGCCGCCTAGCATTTCATCAC 

GTGGCCCGAGAGCTGCATCCGGAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGAT 

CCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGCCTAAAACTGCTTGTACCAAT 

TGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACAAAAGCCTTA 

GGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGT 

CAGACTCATCAAGTTTCTCTATCAAAGCAACCCACCTCCCAATCCCGAGGGGACCCG 

ACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATCACCATTAA 



Protein sequence (Sea. ID. No. 21) 

(Amino-acids corresponding to Prot D fusion partner are in bold) 

MDPS SHS SKMANTQMKSDKI 1 1 AHRGAS GYLPEHTLE SKALAFAQQAD YLEQDL AMT 
KDGRLWIHDHFLDGLTDVAKKFPHRHRKDGRYYVIDFTLKEIQSLEMTENFETMGG 

KWS KS S WGWP TVRERMRRAEPAADGVGAASRDLEKHGAI TS SNTAATNAACAWLE A 

QEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHT 

QGYFPDWQNYTPGPGVRYPLTFGWCYKLVPVEPDKVEEANKGENTSLLHPVSLHGMD 

DPEREVLEWRFDSRLAFHHVARELHPEYFKNCTSEPVDPRLEPWKHPGSQPKTACTN 

CYCKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSRGDP 
TGPKETSGHHHHHH . 



=> Tat-MUTANT-HK 



DNA sequence (Seq. ID. No. 22) 
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ATGGAGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATC 4 0 
CAGGAAGTCAGCCTAAAACTGCTTGTACCAATTGCTATTG 8 0 
TAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATAACA 120 
G CTGCCTT AGGCAT CT C CT ATGG CAGGAAGAAG CGGAGAC 160 
AGCG ACGAAGAC CT C CT CAAGGC AGTC AGACTCATC AAGT 200 
TTCTCTATCAAAGCAACCCACCTCCCAATCCAAAGGGGAG 24 0 
CCGACAGGCCCGAAGGAAACTAGTGGCCACCATCACCATC 280 
ACCATTAA 288 



Protein sequencefSeq. ID. No. 23) 

Mutated amino-acids in Tat sequences are in bold. 

MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFIT 4 0 
AALGISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSKGE 80 

PTGPKETSGHHHHHH . 9 5 

=>Nef-Tat~Mutant-HIS 

DNA sequencefSeq. ID. No. 24) 

ATGGGTGGCAAGTGGTCAAAAAGTAGTGTGGTTGGATGGC 4 0 
CTACTGTAAGGGAAAGAATGAGACGAGCTGAGCCAGCAGC 8 0 
AGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAAAC AT 120 
GGAGCAATCACAAGTAGCAATACAGCAGCT AC CAATGCTG 160 
CTTGTGCCTGGCTAGAAGCACAAGAGGAGGAGGAGGTGGG 200 
TTTTCCAGTCACACCTCAGGTACCTTTAAGACCAATGACT 24 0 
TACAAGG C AGCTGTAGAT CTTAGC C ACTTTTTAAAAG AAA 280 
AGGGGGGACTGGAAGGGCTAATTCACTCCCAACGAAGACA 320 
AGAT ATC CTTGAT CTGTGGATCT AC CAC ACACAAGGCTAC 360 
TTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGGTCA 400 
GATATCC ACTGA£CTTTGGATGGTGCTACAAGCTAGT ACC 440 
AGTTGAGCCAGATAAGGTAGAAGAGGCCAATAAAGGAGAG 4 80 
AAC AC C AGCTTGTT AC AC CCTGTGAG C CTGC ATGG AATGG 520 
ATGAC C CTGAGAGAGAAGTGTT AGAGTGGAGGTTTG ACAG 560 
CCGCCTAGCATTTCATCACGTGGCCCGAGAGCTGCATCCG 600 
GAGTACTTCAAGAACTGCACTAGTGAGCCAGTAGATCCTA 640 
GACTAGAGC CCTGG AAGCAT CCAGG AAGT CAG C CT AAAAC 680 
TGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCAT 720 
TGCCAAGTTTGTTTCATAACAGCTGCCTTAGGCATCTCCT 760 
ATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCCTCA 8 00 
AGGCAGTCAGACTCATCAAGTTTCTCTATCAAAGCAACCC 840 
ACCTCCCAATCCAAAGGGGAGCCGACAGGCCCGAAGGAAA 880 
CTAGTGGCCACCATCACCATCACCATTAA 90 9 
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Protein sequence (Seq. ID. No, 25) 

Mutated amino-acids in Tat sequence are in bold. 

MGG KWS KS S VVG WPTVRERMRRAE P AADG VG AAS RDLE KH 4 0 
GAITSSNTAATNAACAWLEAQEEEEVGFPVTPQVPLRPMT 8 0 
YKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGY 12 0 
FPDWQNYTPGPGVRYPLTFGWCYKLVP VEPDKVEEANKGE 160 
NTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHP 200 
EYFKNCTSEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFH 24 0 
CQVCFITAALGI SYGRKKRRQRRRPPQGSQTHQVSLSKQP 280 

TSQSKGEPTGPKETSGHHHHHH . 3 02 



4 
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. 5 SDS-PAGE: Nef-Tat-his fusion protein 



1 2 3 4 5 6 7 




Coomassie blue G250 



1: MW (175/83/62,5/47,5/32,5/25/16,5/6,5 kDa) 

2: TNH/23 SP eluate (4 ug) 

3: TNH/23 Superdex200 eluate (4 ug) 

4: TNH/23 Purified bulk (4 ug) 

5: TNH/22 Purified bulk (4 ug) 

6: TNH/23 Purified bulk (4 ug) / non reducing conditions 
7: TNH/22 Purified bulk (4 ug) / non reducing conditions 
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Fig. 8 



Cell binding assay 
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Fig. 9 Inhibition of cell growth 
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